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Abstract 

Background: Luba is one of the four historical foci of Human African Trypanosomiasis (HAT) on Bioko Island, in 
Equatorial Guinea. Although no human cases have been detected since 1995, T. b. gambiense was recently 
observed in the vector Glossino palpalis palpalis. The existence of cryptic species within this vector taxon has been 
previously suggested, although no data are available regarding the evolutionary history of tsetse flies populations 
in Bioko. 

Methods: A phylogenetic analysis of 60 6. p. palpalis from Luba was performed sequencing three mitochondrial 
(COI, ND2 and 16S) and one nuclear (rDNA-ITSI) DNA markers. Phylogeny reconstruction was performed by 
Distance Based, Maximum Likelihood and Bayesian Inference methods. 

Results: The COI and ND2 mitochondrial genes were concatenated and revealed 10 closely related haplotypes with 
a dominant one found in 61.1% of the flies. The sequence homology of the other 9 haplotypes compared to the 
former ranged from 99.6 to 99.9%. Phylogenetic analysis clearly clustered all island samples with flies coming from 
the Western African Clade (WAC), and separated from the flies belonging to the Central Africa Clade (CAC), 
including samples from Mbini and Kogo, two foci of mainland Equatorial Guinea. Consistent with mitochondrial 
data, analysis of the microsatellite motif present in the ITS1 sequence exhibited two closely related genotypes, 
clearly divergent from the genotypes previously identified in Mbini and Kogo. 

Conclusions: We report herein that tsetse flies populations circulating in Equatorial Guinea are composed of two 
allopatric subspecies, one insular and the other continental. The presence of these two G. p. palpalis cryptic taxa in 
Equatorial Guinea should be taken into account to accurately manage vector control strategy, in a country where 
trypanosomiasis transmission is controlled but not definitively eliminated yet. 
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Background 

Following the London Declaration on Neglected Tropical 
Diseases, Human African Trypanosomiasis (HAT) has been 
targeted for elimination by 2020 [1]. As Trypanosoma 
brucei gambiense infection actually causes 98% of the total 
HAT cases (the remaining are due to T. b. rhodesiense), 
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attention must be focused on this subspecies. Among the 
elimination strategies, vector control can play an important 
role [2-4], especially in isolated populations, which can be 
targeted for direct intervention avoiding the reinvasion 
from neighbouring zones. Islands represent an ideal setting 
for such strategies as demonstrated by the eradication of 
Glossina sp. in Unguja [5] and in Principe Islands [2] after a 
few years of sustained control. 

Tsetse flies of the palpalis group (Nemorhina subgenus) 
are major vectors of T. b. gambiense in West Africa [2]. 
This group comprises two allopatric subspecies: G. p. 
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palpalis and G. p. gambiensis, which probably derived from 
an ancestral palpalis species which was isolated in several 
geographic points when its riverine habitat declined during 
the last glacial maximum [6,7]. Cumulative evidences sup- 
port the recognition of G. p. gambiensis and G. p. palpalis 
as valid specific taxa. For example, using data from the 
mitochondrial gene cytochrome oxidase 1 (COI), the aver- 
age genetic distance observed between G. p. palpalis and G. 
p. gambiensis sequences was 6.6%, which is well above the 
threshold of 2% divergence for inter-species comparisons 
[8-11]. Moreover, experimental crosses between these sub- 
species yielded sterile males in the offspring [12]. The 
phylogenetic situation is more complex since recent genetic 
analyses suggested the existence of at least two distinct 
cryptic species within G. p. palpalis [10,11]. One circulates 
in the Western part of Africa (named as Western African 
Clade or WAC), and the other in the continental part of 
Equatorial Guinea and the Democratic Republic of Congo 
(DRC) (Central African Clade or CAC). According to the 
available data, both types are sympatric in the Fontem focus 
of Cameroon [11]. 

In Equatorial Guinea, four historical HAT foci are 
classically defined: one insular (Luba, on Bioko Island) 
and three on the mainland (Rio Campo in the north; 
Mbini in the centre and Kogo in the south) [13]. Due to 
sustained control measures, less than 10 HAT cases are 
being detected every year in the three continental foci 
since 2009 and no cases have been recorded since 1995 
in Luba. Vector control activities were never imple- 
mented in Bioko Island, and parasite elimination in 
humans relied on active screening of the population at 
risk and systematic treatment [14]. Therefore, high dens- 
ities of G. p. palpalis have been observed in some local- 
ities at the south of Luba district and moderate densities 
in others of the epicentre of the focus [15]. Moreover 
the presence of T. b. gambiense has also been reported 
in tsetse flies of Luba despite the absence of human in- 
fections, which could be attributed to the existence of 
reservoirs in the wild fauna, cryptic human infections and/ 
or low sensitivity of available diagnostic tools [15-17]. 
Because vector control is a key parameter to completely 
eradicate the parasite [2,4,5] a deep knowledge of the biol- 
ogy of the tsetse fly is a crucial prerequisite. In such a con- 
text, the genetic characterization of the Glossina palpalis 
palpalis, never performed so far in Luba, has become 
indispensable. 

In this study we combine information from both mito- 
chondrial DNA (mtDNA- COI, ND2 and 16S- genes) 
and nuclear ribosomal (rDNA-ITSl) markers to investi- 
gate the phylogeographic origin of G. p. palpalis in the 
focus of Luba, Bioko Island, using tsetse flies samples cap- 
tured in a previous epidemiological study [15]. MtDNA has 
been extensively used in population and evolutionary biol- 
ogy of insects [18-20] and metazoa in general [21] due to 



their particular features: relative ease isolation, simple 
sequence organization, maternal inheritance, absence of 
recombination and rapid rate of sequence divergence allow- 
ing discrimination of recently diverged lineages [22]. On 
the other hand, the rDNA internal transcribed spacer 1 
(ITS1) is a useful marker for both closely related species 
and also intraspecific populations of insects [23-25]. 

Methods 

Sample collection 

Fly sampling was carried out in September/October 
2007 from five areas known to harbour G. p. palpalis 
(Avendano, Drumen, Fortuny, Boloco and Las Palmas). 
We employed monopyramidal traps [26], which have 
been successfully applied for vector control and entomo- 
logical surveys in Equatorial Guinea [27-29]. Details 
about trap distribution are provided elsewhere [15]. Tse- 
tse flies collected were individually stored in absolute 
ethanol in the field until processed in the laboratory. 
Species identification was undertaken using the key of 
Brunhes et al. [30]. Tsetse flies were sent to the National 
Centre of Tropical Medicine, Institute of Health Carlos 
III (Madrid, Spain) for subsequent molecular analysis. 

Molecular analysis 

DNA was extracted from whole flies with SpeedTools 
Tissue DNA Kit (Biotools, B & M Labs, S.A) following 
the manufacturer instructions. We analysed three mtDNA 
(ND2, COI and 16S) and one nuclear (ITS1) markers in 
our study. COI, 16S and ITS1 sequences were amplified 
with primers described previously [11,31] whereas new 
specific primers for ND2 gene from G. p. palpalis were 
designed (Additional file 1: Table SI). 

PCR reactions were performed with 2 \A of each template 
DNA, IX buffer (10 mM Tris-HCl, 1.5 mM MgCl 2 , 
50 mM KC1, pH 8.3), 100 (iM of each dNTP, 0.5 \iM of 
each primer, 1 U of Fast Start Taq DNA polymerase 
(Roche) and double distilled water (DDW) until reaching 
50 \A final volume. The thermal cycling programme started 
with initial denaturation step of 2 minutes at 95°C, followed 
by 35 cycles (30 seconds at 95°C, 30 seconds at 55-60°C 
and 1 minute at 72°C) and a final extension step of 5 mi- 
nutes at 72°C. Results were visualized in 2% agarose gel 
stained with ethidium bromide under UV irradiation. After 
this check, we sent the amplified products to Secugen 
Sequencing and Molecular Diagnostics (Madrid, Spain) 
where they were sequenced using Sanger method. 

Forward and reverse strands of all sequences were 
manually inspected with Sequence Scanner software vl.O 
(Applied Biosystems© 2005). Sequences were trimmed 
and aligned using ClustalW Multiple Alignment algo- 
rithm of BioEdit Sequence Alignment Editor version 
7.0.9.0 [32]. All sequences obtained in this work are 
available in GenBank with the following accession 
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numbers: KF597286-91 (COI), KF597292-5 (ND2), 
KF597296 (16S) and KF597297-8 (ITS1). 

Phylogenetic analysis 

MEGA version 5 software [33] was used to calculate the 
pairwise and average distances expressed as number of 
nucleotide substitutions per site. Phylogeny reconstruc- 
tion was performed with Maximum Likelihood (ML), 
Distance Based Neighbor-Joining (NJ) and Bayesian In- 
ference (BI) methods. Model of evolution was inferred 
from data using the Find DNA/Protein Model tool of 
MEGA5. This software implements the Bayesian Informa- 
tion Criterion (BIC) [34] and corrected Akaike Information 
Criterion (AIC) [35] to measure the goodness-of-fit of the 
24 models available. Unless otherwise specified, we applied 
the model selected by BIC to perform phylogenetic analysis 
(Table 1). For these three inference methods, all positions 
containing the missing data were eliminated. 

MEGA5 software was also used to construct ML and 
distance based trees of COI, ND2 and concatenated 
(COI + ND2) sequences. For ML, the bootstrap method 
(500 replications) was chosen to test the robustness of 
the trees [39]. The selected substitution model was ap- 
plied and rates among sites were treated as Gamma dis- 
tributed with five discrete categories. We assumed no 
invariant sites. The ML heuristic method employed was 
the Nearest-Neighbor-Interchange (NNI). 

Distance trees were constructed using the NJ method 
[40]. Gamma shape parameter was estimated from data. 
The bootstrap consensus tree was inferred from 2000 
replicates. 

BI was implemented with BEAST software [41]. This 
software uses Metropolis-Hasting Markov Chain Monte 
Carlo algorithm [42]. The default settings were generally 
used (lxlO 7 generations and log parameters sampled 
each 1000 steps). The first 25% of trees generated was 
discarded as 'burnin. Yule process was implemented as 
tree prior, a simple model appropriate when studying 
speciation [43,44]. 



Assessment of genetic diversity 

Under our sampling conditions, we assume that un- 
detected haplotypes can exist. In order to assess the 
number of these unseen haplotypes in our studied area 
(Luba focus), we used two estimators that calculate 
haplotype richness: i) the first-order Jackknife richness 
estimator [45] calculated as followed: Jackl = S 0 + al(N-l)/ 
N, where S 0 = observed number of haplotypes, al = number 
of haplotypes detected in only one fly, N = total number of 
flies and ii) the Bootstrap richness estimator [46] boot = 
S 0 + Z(l-pi) N , where S 0 = observed number of haplotypes, 
Pi = frequency of ith haplotype and N = total number of 
flies. Analysis was conducted in R software, with the 
specific Vegan package (http://cran.r-project.org, http:// 
vegan.r-forge.r-project.org). 

Phylogeographic analysis 

To infer the haplotype relationships within the data sets, 
the median- joining network algorithm available in NET- 
WORK v4.5. 1.0 was performed [47], which combines the 
topology of a minimum spanning tree with a parsimoni- 
ous search for the missing haplotypes. 

Results 

Mitochondrial markers 

We amplified the mtDNA of COI (622 bp), ND2 
(501 bp) and 16S (213 bp) genes for a total of sixty 
tsetse flies, all coming from Equatorial Guinea. Figure 1 
(COI + ND2) and Table 2 show the distribution of 
mtDNA haplotypes regarding the sampling location. In 
COI gene six different haplotypes were observed with an 
overall genetic distance of 0.4%, ranging from 0.2% to 
0.8%, in terms of number of base substitution per site. 
Haplotype 1 predominated over the rest (44/60), haplo- 
type 2 was found in 9 individuals, whereas the other 
haplotypes were found once or twice (Table 3). For 
ND2, fifty-four sequences were obtained and four haplo- 
types were detected with an average distance of 0.4%, 
ranging from 0.2% to 0.5%. Haplotype 1 was the most 



Table 1 Summary of models chosen in phylogenetic analysis 



Marker 
Inference method 


COI 




ND2 




COI + ND2 


NJ ML 


BI 


NJ ML 


BI 


NJ ML BI 


Model 


T92 + T 


TN93 + T 


T92 + T 


HKY + T 


TN93 + T 


BIC 


4097,84 


4101,20 


2186,19 


2193,06 


5311,13 


AIC 


3592,74 


3572,45 


2040,86 


2034,53 


5038,40 


-Lnl 


1732,16 


1719 


998,34 


993,16 


2484,13 


Length 


622 




501 




1123 


Number of haplotypes 


30 




9 




14 


Overall mean distance (SE) 


0.0176 (0.0062) 




0.0093 (0.0064) 




0.0149 (0.0053) 



NJ: distance-based Neighbor-Joining, ML: Maximum Likelihood, BI: Bayesian Inference, T92: Tamura 3-parameter model [36], TN93: Tamura-Nei model [37], HKY: 
Hasegawa-Kishino-Yano model [38], l~: assuming Gamma distribution (five discrete gamma categories), BIC: Bayesian Information Criterion (score), AIC: corrected 
Akaike Information Criterion (score), -Lnl: Maximum Likelihood value, SE: Standard Error. 
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common (45/54). Concatenated COI + ND2 yielded 10 
different haplotypes and revealed that 61.1% of individ- 
uals (33/54) shared the same pattern (haplotype 11, i.e. 
haplotype 1 at both COI and ND2). 

Analysis of genetic diversity seemed to be assessed 
with a reasonable accuracy. The number of estimated 
haplotypes, n = 13.93 and 11.86 calculated with Jackknife 
and Bootstrap estimators respectively, was close to the 
number of observed haplotypes (n = 10) when all 
the samples are considered as a unique population 
(Additional file 2: Table S2). This means that the 
haplotype structure of the population is correctly esti- 
mated even with low sample size and underestimation 
of the haplotype richness. 

Since very limited data are available regarding the 
ND2 and 16S sequences in G. p. palpalis from different 
geographic origins, we decided to focus the phylogen- 
etic reconstruction in COI gene. To construct more 



comprehensive ML, distance based NJ and BI trees we 
included previously published sequences of G. p. pal- 
palis from DRC (EU591840-2), Cameroon (EU591829- 
31, EU591860 and EU591865), Liberia (EU591857-9), 
Togo (EU591838-9), Ivory Coast (EU591846-8 and 
EU591832-3), Burkina Faso (EU591856) and two con- 
tinental foci of Equatorial Guinea, Kogo and Mbini 
(EU591825 and FJ767873-6). The source of these 
sequences is detailed elsewhere [10,11]. G. morsitans 
(GQ255905) was used to root the tree and G. p. gambiensis 
(EU591851) sequence was also placed as outgroup. Analysis 
was based on a total fragment of 622 bp of the COI gene. 

ML, distance based NJ and BI trees showed common 
topologies. All mirrored two major geographic separated 
clusters from Central (CAC) and Western Africa (WAC) 
(Figure 2). NJ and BI trees showed more support in this 
split: 97% bootstrap and 0.9999 posterior probability in 
BI trees, respectively, than ML inference (88%). Within 



Table 2 Distribution of the mtDNA haplotypes within Luba focus 



COI haplotypes 



Las Palmas 
Fortuny (Boloco) 
Fortuny 
B. Drumen 
B. Avendaho 
Total 



9 
7 

14 

6 

44 



17 
5 

45 



ND2 haplotypes 
2 3 

1 1 
1 

1 

1 

2 

5 2 



COI + ND2 haplotypes 



11 

6 
7 
5 

13 

2 

33 



21 



12 13 

1 1 
1 



31 



41 



14 51 



54 



61 



1 1 



For each location is detailed the number of times a given haplotype has been obtained. 
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Table 3 Distribution of ITS1 genotypes within Luba focus 



WAC, organisation was unclear and again NJ and BI 
methods yielded a more robust clustering. ML inference 
method failed to yield a clear discrimination between 
clades from Western Africa (50% bootstrap) whereas NJ 
and BI trees split samples from this area in different sis- 
ter sympatric groups (with 99% bootstrap and 1.0000 
posterior probability). Clusters obtained by the NJ and 
BI methods within the WAC are probably not artifacts 
since both trees separate exactly the same specimens 
and in spite of the lower power of the ML tree, the same 
trend is visible. 

Regardless of the inference method used the COI hap- 
lotypes from the Luba focus grouped within the WAC I 
cluster in all trees, in clear contrast with the haplotypes 
from Kogo and Mbini foci (mainland Equatorial Guinea) 
that belong to the CAC cluster. In accordance to 



Total number of samples 

0% 10 

3% 9 

0% 10 

2% 18 

3% 12 

4% 59 



phylogeny trees, network analysis shows the clear separ- 
ation of the two major geographic clusters (WAC and 
CAC) (Figure 3) together with the two subgroups corre- 
sponding to DRC and continental Equatorial Guinean 
haplotypes within the CAC (CAC I and II in Figure 2). 

Regarding ND2 gene, five haplotypes were available in 
Genbank, one from Ivory Coast (EU591895), one from 
Liberia (EU591884), two from Cameroon (EU591897-8) 
and one from Equatorial Guinea (EU591905). In accord- 
ance with COI data, ML, distance based NJ and BI trees 
constructed with ND2 and COI + ND2 concatenated 
genes (Figure 4), exhibited a clear separation between 
the sequences of mainland Equatorial Guinean haplo- 
types and those from the Luba focus. 

The 16S sequences were 100% identical in the 34 ran- 
domly selected individuals from Luba analysed by this 



ITS genotypes 



Village 



Genotype 1: (TA) 2 C (AT) 9 



Genotype 2: (TA) 10 



Las Palmas 
Fortuny/Boloco 
Fortuny 
B. Drumen 
B. Avendano 
Total 



7 

14 



44 



90,0% 
66,7% 
70,0% 
77,8% 
66,7% 
74,6% 



1 

3 
3 
4 
4 
15 



10, 

33, 
30, 
22, 
33, 
25, 



For each location is detailed the number of times a given genotype has been obtained. 



- LIB_15 

- CDI_5 
-LIB_17 

- CDI_4 

- CDI_6 

- CAM_35 

- CAM_33 

- EQG-LJ 

- EQG-L_4 

- EQG-L_3 

- CAM_30 

- CAM_34 

- EQG-L_5 

- EQG-LJ2 

- EQG-LJS 

- BUF_13 

- UB_14 

- CDIJ0 

- CDI_8 

- TOG_11 

- TOG_12 

- DRC_2 

- DRC_3 

- DRC_1 

- CAM_20 

- EQG-M_23 

- EQG-M_25 

- EQG-K_22 

- EQG-K_21 

- EQG-MJ24 

- G. p. gambiensis 

- G. morsitans 



Western African Clade I 



Other West African 
Clades 



Central African Clade I 



Central African Clade I 



Figure 2 Phylogenetic reconstruction inferred for the G. p. palpalis COI dataset. Majority rule of the maximum likelihood consensus, 
distance consensus and Bayesian trees inferred for the COI dataset. Models of evolution used are detailed in Table 1. Numbers shown next to the 
nodes are the support values, i.e. the percentage of bootstrap replications (500 replicates in maximum likelihood and 2000 in distance based 
analysis) and posterior probability in Bayesian trees. Analysis involved 622 positions in the final dataset. EQG-L Equatorial Guinea, Luba focus, 
EQG-M: Equatorial Guinea, Mbini focus, EQG-K: Equatorial Guinea, Kogo focus, DRC. Democratic Republic of Congo, CAM: Cameroon, LIB: Liberia, 
CDI: Cote d'lvoire, TOG: Togo, BUF: Burkina Faso. ns: no support (<50 bootstrap or 0.5 posterior probability). 
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B 



Western African Clade 



Schematic separation line 
between WAC/CAC 




O Burkina Faso 

# Togo 

# Cote d' I voire 
O Liberia 

# Eq. Guinea (Bioko) 

# Cameroon 

# DRC 

# Eq. Guinea (mainland) 



Central African Clade 



Figure 3 Geographic representation of the COI haplotypes. A) Geographic distribution of the COI lineages; B) Haplotypes network derived 
from 30 haplotypes of the G. p. polpolis complex. Haplotypes are represented by circles and their frequency is proportional to the area. Network 
diagrams created using Phylogenetic Network software from Fluxus and using the Median Joining method as described previously [47]. 



marker. Only one G. p. palpalis 16S sequence, from 
Cameroon, was available in GenBank (EU591913) before 
this work. Alignment of the single 16S haplotype found in 
Luba was performed with this Cameroonian sequence and 
other Glossina taxa, namely G. p. gambiensis (EU591911.1), 
G. fuscipes quanzensis (EU591910.1), G. fuscipes fuscipes 
(EU591906.1), G. tachinoides (EU591917.1), G. pallicera 
(EU591918.1), G. morsitans morsitans (EU591920.1) and G. 
pallidipes (EU591925.1) (Additional file 3: Figure SI). This 
alignment revealed 1.91% distance between G. p. palpalis 



from Luba and Cameroon, and distances ranging between 
2.4% (G. p. gambiensis) and 8.97% (G. pallidipes) when 
comparing Luba 16S haplotype with other Glossina taxa 
(Maximum Likelihood Composite Model [48]). 

Nuclear ITS1 marker 

ITS1 size polymorphism was assessed for 59 flies. All 
PCR products analysed exhibited a unique size around 
240 bp, similar to the size reported by Dyer et al [11] 
when this fragment was amplified from flies belonging 



B 



- EQG-L_3 

- EQG-L_1 

- EQG-L2 

- CDI_4 

- EQG-L_4 

- LIB_13 

- EQG-K_21 

- CAM_19 

- CAM_20 

- G. p. gambiensis 

- G. morsitans 



- EQG-L_1-1 

- EQG-LJ-4 

- EQG-LJ-3 

- EQG-L_4-1 

- EQG-L_3-1 

- EQG-L_5-1 

- EQG-L_2-1 

- EQG-L_6-1 

- EQG-L_5-4 

- EQG-LJ-2 

- CDI_4 

- LIB_13 

- CAM_20 

- EQG-K_21 

- G. p. gambiensis 

- G. morsitans 



Figure 4 Phylogenetic reconstruction inferred for the ND2 and concatenated COI + ND2 datasets. Majority rule of the maximum likelihood 
consensus, distance consensus and Bayesian trees inferred for the A) ND2 and B) concatenated COI + ND2 dataset. Models of evolution used are 
detailed in Table 1. Numbers shown next to the nodes are the support values, i.e. the percentage of bootstrap replications (500 replicates in 
maximum likelihood and 2000 in distance based analysis) and posterior probability in Bayesian trees. Analysis involved 501 (ND2) or 1 123 (COI + 
ND2) positions in the final dataset. EQG-L: Equatorial Guinea, Luba focus, EQG-K: Equatorial Guinea, Kogo focus, CAM: Cameroon, LIB: Liberia, 
CDI: Cote d'lvoire. ns: no support (<50 bootstrap or 0.5 posterior probability). 
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to the WAC After sequencing, two similar genotypes de- 
fined by the (AT) n microsatellite were found. Genotype 1 
was characterized by (AT) 9 G(TA) 2 region, whereas geno- 
type 2 exhibited (AT) 10 pure repeats (Additional file 4: 
Figure S2). As shown in Table 3, genotype 1 was dominant 
in all the 5 sampled areas of the Luba focus. 

The two genotypes found were aligned with ITS1 se- 
quences of G. p. palpalis from mainland Equatorial 
Guinea (Kogo: J767886 and Mbini: J767887-J767888), 
DRC (J767892 and J767893), Guinea Conakry (EU59 
1930), Gambia (EU591931), Burkina Faso (EU591932), 
Togo (EU591933), Liberia (EU591934) and Ivory Coast 
(EU5991935) (Additional file 4: Figure S2). The main 
genotype, found in 44/59 flies and defined by a (AT) 9 G 
(TA) 2 domain, was not detected in the other published 
sequences. The second genotype, defined by a (AT) 10 
repeat, shared 100% homology with the sequence of a 
Togo sample. 

Discussion 

Luba focus is located at the edge between the two main 
clades of G. p. palpalis, one named WAC (for Western 
Africa Clade), including flies from Cameroon, Burkina 
Faso, Ivory Coast, Liberia and Togo; and the other, 
named CAC (for Central African Clade) represented by 
flies from the Democratic Republic of Congo and the 
mainland region of Equatorial Guinea. Phylogenetic ana- 
lysis, using mtDNA markers, allowed us to cluster the G. 
p. palpalis population from Luba within the WAC. All 
the samples of this work were unambiguously included 
in this group and separated from the CAC regardless the 
phylogenetic inference method used. 

Our results are consistent with the geological history 
of Bioko, originated by volcanic eruptions in the lower 
Tertiary period, around 60 million years BP. Bioko is a 
part of an archipelago which pertains to a large volcanic 
fracture originating in the South of Lake Chad and ex- 
tending to Mount Cameroon in the continent [49]. Al- 
though politically belonging to Equatorial Guinea, Bioko 
lies closer to the Cameroon coast {ca. 30 km) rather 
than to the rest of Equatorial Guinea territories (more 
than 200 km from mainland and 700 km from Annobon 
island). During geoclimatic events of Quaternary period, 
Bioko was linked to mainland given the lower sea level, 
presumably to Cameroonian coast because of its geo- 
logical origin. At the end of the last glaciation (around 
12,000 years BP), Bioko was isolated by flooding and 
separated from the continent [50]. It is probable that G. 
p. palpalis population of Bioko was isolated from 
Cameroon coast after that event, as suggested for other 
insect vectors such as Anopheles melas and Simulium 
yahense [51,52]. 

Both mtDNA and nuclear markers show a very low gen- 
etic intra-variability. MtDNA genes data polymorphism did 



not exceed 0.8% and ITS1 sequences only yielded two 
closely related genotypes. For ITS1, the more abundant one 
is apparently exclusive of Luba focus, suggesting a certain 
degree of isolation. This is in accordance with previous 
studies, which found that G. p. gambiensis populations sep- 
arated from the continent only by 4-5 km of sea show clear 
evidences of complete isolation. These results are based on 
wing landmarks, DNA mitochondrial markers and micro- 
satellite dataset [53,54]. Our data should be also completed 
with an exhaustive coalescent population genetics analysis 
to support or not the hypothesis of an on-going allopatric 
speciation in Luba. 

Because of the presence of these two sympatric ITS1 
genotypes in Bioko Island one could speculate about the 
past occurrence of at least two separated migration 
events, as suggested for Anopheles gambiae in Bioko 
[55]. The ITS1 tandem array sequences are expected to 
be quickly homogenized by concerted evolution in inter- 
breeding populations, whereas differences are usually ob- 
served between non-interbreeding ones [56,57]. However, 
the low difference observed between our ITS1 genotypes is 
based in the AT repeats, where a slight heterogeneity can 
be expected since microsatellite sequence could be evolving 
faster than the homogenization process. This phenomenon 
has been previously described in the tandem repeated U2 
snRNA gene of primates [58,59]. 

Other possible explanation for the existence of two 
distinct ITS1 genotypes is a more recent reintroduction 
of tsetse flies from continent. Bioko Island is located at 
30 km West of the Cameroonian coast, the closest area 
of the continent. Estimated active dispersal of Glossina 
sp. in one day is no longer than 1.3 km (in 15-30 min/ 
day) [60,61]. Additionally, these flies are usually unable 
to flight for long periods but rather in short bursts last- 
ing between 1 and 2 minutes [62,63]. These observations 
make extremely unlikely a recent (posterior to the glaci- 
ations period) reproductive contact between Luba and 
other continental foci due to active dispersal. However, 
human-mediated transportation of flies may not be dis- 
regarded. This situation allowed the reinvasion of G. p. 
palpalis in Principe Island in 1956 despite its eradication 
in 1914 using mobile traps carried by workers [2]. The 
distance between this island {ca. 240 km) and mainland 
is much larger than that of Bioko. It is therefore ex- 
pected that human movement between Bioko and the 
coast of Cameroon has been much more frequent since 
the island was first colonized by the Bubi ethnical group, 
at the end of the last glaciation [64]. Finally, although 
the existence of ITS1 hybrid forms was initially sug- 
gested in G. p. palpalis of Equatorial Guinea [11], this 
hypothesis was later rejected by the same authors using 
microsatellite markers [10]. A set of highly polymorphic 
nuclear markers should be applied in the future to de- 
finitively test this hypothesis in Luba. 



Cordon-Obras et al. Parasites & Vectors 2014, 7:31 
http://www.parasitesandvectors.eom/content/7/1/31 



Page 8 of 10 



The genetic markers used to assess geographical struc- 
turing of G. p. palpalis demonstrate the existence of two 
allopatric taxa in Equatorial Guinea, one in the insular 
focus, Luba, and the other in two of the mainland foci 
(Kogo and Mbini). G. p. palpalis populations from Rio 
Campo, the third mainland focus of the country, have 
not been studied yet. In Equatorial Guinea, vector con- 
trol was implemented in Kogo and Mbini from 2002 to 
2009, whereas in Luba the active detection and treat- 
ment of HAT cases was the only control method 
employed [14]. Vector control has proved to be efficient 
at controlling HAT transmission [2,5,13] since tsetse 
flies show particular features that make them highly sus- 
ceptible to direct interventions. Firstly, tsetse flies have a 
very low reproductive rate, given that a female individual 
has probably only one reproductive mating in its life and 
deposits only one larva per generation (up to 12 genera- 
tions with 9-10 days of interval for a lifespan of 2- 
3 months) [2]. As a result of this low reproductive rate, 
the population of the vector is usually low comparing to 
other diptera and small increases in mortality can lead 
to control [4,65] or even to population extinction [66]. 
Secondly, the active dispersal of tsetse flies is generally 
low [60,61] resulting in a more difficult re-colonization 
of cleared areas. Thirdly, the genetic variability observed 
within populations tends to be reduced as well [67,68], 
probably as a consequence of both low reproductive rate 
and limited dispersal capacity, making more difficult for 
the selection of new attributes such as resistance to in- 
secticides. There are, however, other behavioural features 
such as feeding preferences or trap-avoidance that can 
vary at subspecies and even at population level. For ex- 
ample, within G. p. palpalis, diverse feeding preferences 
across foci of Cameroon have been observed [69]. Al- 
though this observation could be attributed to the op- 
portunistic feeding behaviour of G. p. palpalis [70], it 
could be also associated with the existence of genetically 
different G. p. palpalis populations, given the probable 
isolation of these foci [71]. Indeed, different feeding pat- 
terns in two sympatric populations of G. p. palpalis and 
G. p. gambiensis have also been described in preforest 
areas of Cote dTvoire [72], demonstrating that this fea- 
ture can differ between closely related subspecies. The 
feeding behaviour of the vector can be crucial to design 
an effective vector control campaign since it provides 
valuable information to understand the epidemiological 
cycle of the parasite at local level. 

Conclusions 

If cryptic species of G. p. palpalis are circulating as our 
data and previous observations suggest [11], the occur- 
rence of both variants in Equatorial Guinea could have 
an important impact in the control of sleeping sickness. 
Are vector control methods equally effective for all foci? 



Do tsetse flies from Kogo and Mbini have different feed- 
ing preferences than those of Luba? Could Luba tsetse 
fly populations feed mainly in wild fauna allowing T. b. 
gambiense to persist in the focus even with absence of 
human infections? Are both groups equally competent 
for trypanosome transmission? An entomological study 
comparing feeding behaviour, habitat selection and para- 
site strain specificity of G. p. palpalis populations from 
the island and the continent should be conducted to as- 
certain if the genetic differences observed could be 
reflected on the vector ecology. 
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